RAW SEQUENCE LISTING 
ERROR REPORT 


BIOTECHNOLOGY 

SYSTEMS 



The Biotechnology Systems Branch of the Scientific and Technical Information Center 
(STIC) detected errors when processing the following CRF diskette: 

Application Serial Number: 0 

Art Unit / Team No.: _ 

Date Processed by STIC: _ 


l/nMJoP 

1^9 , —_ 

!fitrLoofi 


THE ATTACHED PRINTOUT EXPLAINS THE ERRORS DETECTED. 

PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS 
BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT 
COMMUNICATION TO THE APPLICANTS ALONG WITH A NOTICE TO 
COMPLY or, 

2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY 

THIS WILL INSURE THAT THE NEXT SUBMISSION RECEIVED FROM THEM 
WILL BE ERROR FREE. 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 


MARK SPENCER 703-308^4212 
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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:34 


INPUT SET; S34610.raw 


This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 


SEQUENCE LISTING 

(1) General Information: 

(i) APPLICANT :EPELBAUM, SABINE URSULA 
FALCO, SAVERIO CARL 
MCDEVITT, RAYMOND ERVIN, III 

(ii) TITLE OF INVENTION:CHIMERIC GENES AND METHODS FOR 
INCREASING THE LYSINE CONTENT OF 

THE SEEDS OF PLANTS 

(iii) NUMBER OF SEQUENCES: 132 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: E. I. DU PONT DE NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19898 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MICROSOFT OFFICE 97 

(D) SOFTWARE: MICROSOFT WINDOWS 95 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii)PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/824,627 

(B) FILING DATE: MARCH 27, 1997 

(vi i i) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CHRISTENBURY, LYNNE M. 

(B) REGISTRATION NUMBER: 30,971 

(C) REFERENCE/DOCKET NUMBER: BB-1037-F 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 302-992-5481 

(B) TELEFAX: 302-892-7949 


Does Not Comply 
Corrected Diskette Needed 



PAGE: 2 RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/049,304 

, , INPUT SET: 

46 (C)TELEX: 835420 

47 

48 


ERRORED SEQUENCES FOLLOW: 
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(2) INFORMATION FOR SEQ ID NO: 6 : 

(i) ■ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 : 

CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC TTC GGC 
Met Ala Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly 
1 5 10 15 

ACC GTT GGA GTA GCA ATG GTT ACT CCA TTC ACG GAA TCC GGA GAC ATC 
Thr Val Gly Val Ala Met Val Thr Pro Phe Thr Glu Ser Gly Asp lie 
20 25 30 

GAT ATC GCT GCT GGC CGC GAA GTC GCG GCT TAT TTG GTT GAT AAG GGC 
Asp lie Ala Ala Gly Arg Glu Val Ala Ala Tyr Leu Val Asp Lys Gly 
' 35 40 45 

TTG GAT TCT TTG GTT CTC GCG GGC ACC ACT GGT GAA TCC CCA ACG ACA 
Leu Asp Ser Leu Val Leu Ala Gly Thr Thr Gly Glu Ser Pro Thr Thr 
50 55 60 

ACC GCC GCT GAA AAA CTA GAA CTG CTC AAG GCC GTT CGT GAG GAA GTT 
Thr Ala Ala Glu Lys Leu Glu Leu Leu Lys Ala Val Arg Glu Glu Val 
65 70 75 

GGG GAT CGG GCG AAG CTC ATC GCC GGT GTC GGA ACC AAC AAC ACG CGG 
Gly Asp Arg Ala Lys Leu lie Ala Gly Val Gly Thr Asn Asn Thr Arg 
80 85 90 95 

ACA TCT GTG GAA CTT GCG GAA GCT GCT GCT TCT GCT GGC GCA GAC GGC 
Thr Ser Val Glu Leu Ala Glu Ala Ala Ala Ser Ala Gly Ala Asp Gly 
100 105 110 



DATE: 02/02/2000 
TIME: 22:09:35 

S34610.raw 
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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:35 


INPUT SET: S34610.raw 

CTT TTA GTT GTA ACT CCT TAT TAC TCC AAG CCG AGC CAA GAG GGA TTG 383 

Leu Leu Val Val Thr Pro Tyr Tyr Ser Lys Pro Ser Gin Glu Gly Leu 

115 120 125 

CTG GCG CAC TTC GGT GCA ATT GCT GCA GCA ACA GAG GTT CCA ATT TGT 431 

Leu Ala His Phe Gly Ala lie Ala Ala Ala Thr Glu Val Pro lie Cys 

130 135 140 

CTC TAT GAC ATT CCT GGT CGG TCA GGT ATT CCA ATT GAG TCT GAT ACC 479 

Leu Tyr Asp lie Pro Gly Arg Ser Gly lie Pro lie Glu Ser Asp Thr 

145 150 155 

ATG AGA CGC CTG AGT GAA TTA CCT ACG ATT TTG GCG GTC AAG GAC GCC 527 

Met Arg Arg Leu Ser Glu Leu Pro Thr He Leu Ala Val Lys Asp Ala 

160 165 170 175 

AAG GGT GAC CTC GTT GCA GCC ACG TCA TTG ATC AAA GAA ACG GGA CTT 575 

Lys Gly Asp Leu Val Ala Ala Thr Ser Leu lie Lys Glu Thr Gly Leu 

180 185 190 

GCC TGG TAT TCA GGC GAT GAC CCA CTA AAC CTT GTT TGG CTT GCT TTG 623 

Ala Trp Tyr Ser Gly Asp Asp Pro Leu Asn Leu Val Trp Leu Ala Leu 

195 200 205 


GGC GGA TCA GGT TTC ATT TCC GTA ATT GGA CAT GCA GCC CCC ACA GCA 671 
Gly Gly Ser Gly Phe lie Ser Val lie Gly His Ala Ala Pro Thr Ala 
210 215 220 


TTA CGT GAG TTG TAC ACA AGC TTC GAG GAA GGC GAC CTC GTC CGT GCG 719 
Leu Arg Glu Leu Tyr Thr Ser Phe Glu Glu Gly Asp Leu Val Arg Ala 
225 230 235 


CGG GAA ATC AAC GCC AAA CTA TCA CCG CTG GTA GCT GCC CAA GGT CGC 767 

Arg Glu lie Asn Ala Lys Leu Ser Pro Leu Val Ala Ala Gin Gly Arg 

240 245 250 255 

TTG GGT GGA GTC AGC TTG GCA AAA GCT GCT CTG CGT CTG CAG GGC ATC 815 

Leu Gly Gly Val Ser Leu Ala Lys Ala Ala Leu Arg Leu Gin Gly lie 

260 265 270 


AAC GTA GGA GAT CCT CGA CTT CCA ATT ATG GCT CCA AAT GAG CAG GAA 863 

Asn Val Gly Asp Pro Arg Leu Pro lie Met Ala Pro Asn Glu Gin Glu 

275 280 285 

CTT GAG GCT CTC CGA GAA GAC ATG AAA AAA GCT GGA GTT CTA TAA TGAGAATTC 

Leu Glu Ala Leu Arg Glu Asp Met Lys Lys Ala Gly Val Leu *' 

290 295 300 



1982 (2) INFORMATION FOR SEQ ID NO: 76: 

1983 

1984 (i) SEQUENCE CHARACTERISTICS: 

1985 (A) LENGTH: 175 base pairs 
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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:35 
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, x INPUT SET: S34610.raw 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 5-1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..172 

(D) OTHER INFORMATION: /function= "synthetic 
storage protein 
/product= "protein" 

/gene= n ssp n 
/standard_name= 

"5.5.5.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
1 5 10 15 

GAG GAG AAG ATG AAG GCG ATG' GAG GAA AAG CTG AAA GCG ATG GAG GAG 94 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAA CTC AAG GCT ATG GAA GAA AAG CTT AAA GCG ATG GAG GAG AAA CTG 142 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu 

35 40 45 

AAG GCC ATG GAA GAG AAG ATG AAG GCG TGATAG 
Lys Ala Met Glu Glu Lys Met Lys Ala 
50 55 



3026 

3027 

3028 
-> 3029 
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(2) INFORMATION FOR SEQ ID NO: 111 : 


(i) SEQUENCE CHARACTERISTICS-; 

(A) LENGTH: <^ 194 ba se pairs 

(B) TYPE: nucleic acTtd-^ 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 : 
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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:36 


INPUT SET: S34610.raw 


3037 

3038 

3039 

ATGAATTCAA 

ATGGCCATGA 

GGAGGAGAAG 

AAGTTGGGGA 

ATGGAGTTGT 

GGGGATTCTA 

60 

3040 

3041 

TCTGAAACAG 

TTAACAAATG 

6 GAGAGACGA 

ACACCATTGA 

CGCCATCGCA 

TTGCGCTCGC 

120 

3042 

3043 

CTTTTACACG 

GTGGGAAAGA 

CAGAACCGGC 

ATTTCCCGCA 

TTGTGGTTCA 

GCCATCTGCT 

180 

3044 

3045 

AAGCGTATCC 

ATCATGATGC 

CTTGTATGAA 

CATGTTGGGT 

GTGAAATTTC 

TGATGATTTG 

240 

3046 

3047 

TCTGATTGTG 

GGCTTATACT 

TGGAATCAAA 

CAACCTGAGC 

TAGAAATGAT 

TCTTCCAGAG 

300 

3048 

3049 

AGAGCATACG 

CTTTCTTTTC 

ACATACTCAT 

AAGGCACAGA 

AAGAGAACAT 

GCCTTTGTTG 

360 

3050 

3051 

GATAAAATTC 

TTTCTGAGAG 

AGTGACTTTG 

TGTGATTATG 

AGCTCATTGT 

TGGGGAT CAT 

420 

3052 

3053 

GGGAAACGAT 

TATTGGCGTT 

TGGTAAATAT 

GCAGGCAGAG 

CTGGTCTTGT 

TGACTTCTTA 

480 

3054 

3055 

CACGGACTTG 

GACAGCGATA 

TCTAAGTCTA 

GGATACTCAA 

CACCTTTCCT 

CTCGCTCGGT 

540 

3056 

3057 

GCATCGTATA 

TGTATTCCTC 

ATTGGCTGCT 

GCAAAAGCCG 

CTGTAATTTC 

TGTTGGTGAA 

600 

3058 

3059 

GAAATTGCAA 

GCCAGGGACT 

GCCATTAGGA 

ATCTGCCCTC 

TTGTATTTGT 

CTTCACCGGA 

660 

3060 

3061 

ACAGGAAATG 

TTTCTCTGGG 

GGCGCAAGAA 

ATTTTCAAGC 

TTCTTCCTCA 

CACTTTTGTT 

720 

3062 

GAACCAAGCA 

AACTTCCTGA 

ACTATTTGTA 

AAA 









GACAAAG GAATTAGTCA AAATGGGATT 780 
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3088 

3089 


TCAACAAAGC GAGTCTATCA AGTATATGGT TGTATTATTA CCAGCCAAGA CATGGTTGAA 
CACAAAGATC CATCAAAGTC ATTCGACAAA GCCGACTATT ATGCACACCC GGAACATTAC 
AATCCAGTTT TCCACGAAAA GATATCGCCA TATACGTCTG TTCTTGTAAA CTGTATGTAC 
TGGGAGAAGA GGTTTCCCTG TCTTCTGAGC ACAAAACAGC TTCAAGATTT AACAAAAAAA 
GGACTCCCAC TAGTAGGCAT ATGTGATATA ACTTGTGACA TCGGTGGCTC CATTGAATTT 
GTTAACCGAG CTACTTTAAT CGATTCCCCT TTCTTCAGGT TTAATCCCTC GAACAATTCA 
TACTACGATG ACATGGATGG GGATGGCGTA CTATGCATGG CTGTTGACAT TTTACCCACA 
GAATTTGCAA AAGAGGCATC CCAGCATTTT GGAGATATTC TTTCCGGATT TGTCGGTAGT 
TTGGCTTCAA TGACTGAAAT TTCAGATCTA CCAGCACATC TGAAGAGGGC TTGCATAAGC 
TATAGGGGAG AATTGACATC TTTGTATGAG TATATTCCAC GTATGAGGAA GTCAAATCCA 
GAAGAGGCAC AAGATAATAT TATCGCCAAC GGGGTTTCCA GCCAGAGAAC ATTCAACATA 
TTGGTATCTC TGAGCGGACA CCTATTTGAT AAGTTTCTGA TAAACGAAGC TCTTGATATG 
ATCGAAGCGG CTGGTGGCTC ATTTCATTTG GCTAAATGTG AACTGGGGCA GAGCGCTGAT 
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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:36 


INPUT SET; S34610.raw 

GCTGAATCGT ACTCAGAACT TGAAGTTGGT GCGGATGATA AGAGAGTATT GGATCAAATC 1620 
ATTGATTCAT TAACTCGGTT AGCTAATCCA AATGAAGATT ATATATCCCC ACATAGAGAA 1680 
GCAAATAAGA TCTCACTGAA GATTGGTAAA GTCCAGCAAG AAAATGAGAT AAAAGAGAAG 1740 
CCTGAAATGA CGAAAAAATC AGGTGTTTTG ATTCTTGGTG CTGGACGTGT GTGTCGCCCA 1800 
GCTGCTGATT TCCTAGCTTC AGTTAGAACC ATTTCGTCAC AGCAATGGTA CAAAACATAT 1860 
TTCGGAGCAG ACTCTGAAGA GAAAACAGAT GTTCATGTGA TTGTCGCGTC TCTGTATCTT 1920 
AAGGATGCCA AAGAGACGGT TGAAGGTATT TCAGATGTAG AAGCAGTTCG GCTAGATGTA 1980 
TCTGATAGTG AAAGTCTCCT TAAGTATGTT TCTCAGGTTG ATGTTGTCCT AAGTTTATTA 2 040 
CCTGCAAGTT GTCATGCTGT TGTAGCAAAG ACATGCATTG AGCTGAAGAA GCATCTCGTC 2100 
ACTGCTAGCT ATGTTGATGA TGAAACGTCC ATGTTACATG AGAAGGCTAA GAGTGCTGGG 2160 
ATAACGATTC TAGGCGAAAT GGGACTGGAC CCTGGAATCG ATCACATGAT GGCGATGAAA 2220 
ATGATCAACG ATGCTCATAT CAAAAAAGGG AAAGTGAAGT CTTTTACCTC TTATTGTGGA 2280 
GGGCTTCCCT CTCCTGCTGC AGCAAATAAT CCATTAGCAT ATAAATTTAG CTGGAACCCT 2340 
GCTGGAGCAA TTCGAGCTGG TCAAAACCCC GCCAAATACA AAAGCAACGG CGACATAATA 2400 
CATGTTGATG GGAAGAATCT CTATGATTCC GCGGCAAGAT TCCGAGTACC TAATCTTCCA 2460 
GCTTTTGCAT TGGAGTGTTT TCCAAATCGT GACTCCTTGG TTTACGGGGA ACATTATGGC 2520 
ATCGAGAGCG AAGCAACAAC GATATTTCGT GGAACACTCA GATATGAAGG GTTTAGTATG 2580 
ATAATGGCAA CACTTTCGAA ACTTGGATTC TTTGACAGTG AAGCAAATCA AGTACTCTCC 2640 
ACTGGAAAGA GGATTACGTT TGGTGCTCTT TTAAGTAACA TTCTAAATAA GGATGCAGAC 2700 
AATGAATCAG AGCCCCTAGC GGGAGAAGAA GAGATAAGCA AGAGAATTAT CAAGCTTGGA 2760 
CATTCCAAGG AGACTGCAGC CAAAGCTGCC AAAACAATTG TATTCTTGGG GTTCAACGAA 2820 
GAGAGGGAGG TTCCATCACT GTGTAAAAGC GTATTTGATG CAACTTGTTA CCTAATGGAA 2880 
GAGAAACTAG CTTATTCCGG AAATGAACAG GACATGGTGC TTTTGCATCA CGAAGTAGAA 2940 
GTGGAATTCC TTGAAAGCAA ACGTATAGAG AAGCACACTG CGACTCTTTT GGAATTCGGG 3 000 
GACATCAAGA ATGGACAAAC AACAACCGCT ATGGCCAAGA CTGTTGGGAT CCCTGCAGCC 3 060 
ATTGGAGCTC TGGTGTTAAT TGAAGACAAG ATCAAGACAA GAGGAGTCTT AAGGCCTCTC 3120 
GAAGCAGAGG TGTATTTGCC AGCTTTGGAT ATATTGCAAG CATATGGTAT AAAGCTGATG 3180 
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RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:37 


3143 

3144 GAGAAGGCAG AATGA 
3X45 


INPUT SET: S34610.raw 

3195 


3359 

3360 

3361 
--> 3362 

3363 

3364 

3365 

3366 

3367 

3368 

3369 

3370 
--> 3371 

3372 


(2) INFORMATION FOR SEQ ID NO: 113: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: 'SEQ ID NO:113: 


TTYTCj 


23 


jj^AYA C03AYAARGC {i^A 




Ox) 

/IJLtyb*— 
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SEQUENCE VERIFICATION REPORT 
PATENT APPLICATION US/09/049,304 


DATE: 02/02/2000 
TIME: 22:09:37 


INPUT SET: S34610.raw 


Line Error 

325 ff of Sequences for line conflicts w/ running total 

2023 it of Sequences for line conflicts w/ running total 

3029 Entered (3194) and Calc. Seq. Length (3195) differ 

3362 Entered (23) and Calc. Seq. Length (20) differ 

3371 Wrong Nucleic Acid Designator 

3371 Wrong Nucleic Acid Designator 

3371 Wrong Nucleic Acid Designator 

3371 § of Sequences for line conflicts w/ running total 


Original Text 

CTT GAG GCT CTC CGA GAA GAC ATG AAA AAA GC 
AAG GCC ATG GAA GAG AAG ATG AAG GCG TGATA 
(A)LENGTH: 3194 base pairs 
(A)LENGTH: 23 base pairs 
TTYTCICAYA CICAYAARGCICA 
TTYTCICAYA CICAYAARGC ICA 
TTYTCICAYA CICAYAARGC ICA 
TTYTCICAYA CICAYAARGC ICA 



